A Framework for Representing and Reasoning about Three-Dimensional Objects for Vision
نویسندگان
چکیده
defined, the reasoning portion of the 3-D FORM system can apply the model to real data, recognizing instances of the defined objects and hypothesizing their missing parts. Figure 1 shows a portion of the knowledge that might be represented about a building. A BUILDING is a 3D-OBJECT; and a WALL, which is a 2D-OBJECT, is one of the building's parts. The specific building, BLDG1, has parts WALL1 and WALL2, whose geometric features are the primitive geometric objects PLANE1 and PLANE2, respectively. In addition to its geometric feature, WALL1 has the photometric feature COLOR1. The knowledge that a building's walls are mutually perpendicular is represented for BLDG1 by the instance PRPW12 of the PERPENDICULAR-PLANES relationship. The arguments to PRPW12 are PLANE1 and PLANE2, the geometric features of BLDG1's walls. The geometric feature PLANE1 is supported by the data object REGION1, which came from an image whose projection between 2-D and 3-D is CAM1. In the current implementation, only 3-D objects, geometric features, and geometric relationships are represented. The 3-D FORM system was applied to model-based interpretation of 3-D data and to solutions of sets of geometric constraints. This article discusses some of the issues in geometric reasoning for knowledge-based vision systems and how some existing systems have addressed them. Next, it describes the representations of primitive geometric objects, geometric relationships, and composite objects in the 3-D FORM system. Finally, it presents examples of the system's application to geometric reasoning tasks. e are developing the Framebased Object Recognition and Modeling (3-D FORM) System, a framework for representing and reasoning about three-dimensional (3-D) objects. This framework incorporates representations of objects, representations of the relationships between them, and a geometric reasoning capability. Such a representation and reasoning capability is essential for knowledge-based, 3-D photointerpretation systems that combine domain knowledge with image processing, as demonstrated by 3-D Mosaic (Herman, Kanade, and Kuroe 1984; Herman and Kanade 1986) and ACRONYM (Brooks 1981). It is also required for other applications, such as robot navigation, 3-D change detection, and the simulation of a scene's appearance from arbitrary viewpoints. The 3-D FORM system uses frames to represent objects such as buildings and walls, geometric features such as lines and planes, and geometric relationships such as parallel lines. Active procedures attached to the frames dynamically compute values as needed. Because processing is controlled by slot access, the system can hypothesize new objects or compute and verify relationships between existing objects, depending on the currently available knowledge. The 3-D FORM system can include knowledge about model and data objects organized into IS-A and PART hierarchies, along with relationships between object features, and projections used to convert between model objects and data objects. The knowledge includes both generic object models and specific instances of objects. Once the generic objects and the relationships between them are The capabilities for representing and reasoning about three-dimensional (3-D) objects are essential for knowledgebased, 3-D photointerpretation systems that combine domain knowledge with image processing, as demonstrated by 3D Mosaic and ACRONYM. Three-dimensional representation of objects is necessary for many additional applications, such as robot navigation and 3-D change detection. Geometric reasoning is especially important because geometric relationships between object parts are a rich source of domain knowledge. A practical framework for geometric representation and reasoning must incorporate projections between a two-dimensional (2-D) image and a 3-D scene, shape and surface properties of objects, and geometric and topological relationships between objects. In addition, it should allow easy modification and extension of the system's domain knowledge and be flexible enough to organize its reasoning efficiently to take advantage of the current available knowledge. We are developing such a framework—the Frame-based Object Recognition and Modeling (3-D FORM) System. This system uses frames to represent objects such as buildings and walls, geometric features such as lines and planes, and geometric relationships such as parallel lines. Active procedures attached to the frames dynamically compute values as needed. Because the order of processing is controlled largely by the order of slot access, the system performs both top-down and bottom-up reasoning, depending on the current available knowledge. The FORM system is being implemented with the Carnegie-Mellon University-built Framekit tool in Common Lisp (Carbonell and Joseph 1986). To date, it has been applied to two types of geometric reasoning problems: interpreting 3-D wire frame data and solving sets
منابع مشابه
Future study of Description System Architecture Approaches with Emphasis on Strategic Management
Systems Architecture is a generic discipline to handle objects (existing or to be created) called systems, in a way that supports reasoning about the structural properties of these objects. Systems Architecture is a response to the conceptual and practical difficulties of the description and the design of complex systems. Systems Architecture is a generic discipline to handle objects (existin...
متن کاملDeveloping 3 dimensional model for estimation of acoustic power in urban pathways in geo-spatial information system framework
Around the word, traffic growth is causing growing air and noise pollution. Noise levels in a given area are affected by traffic on the streets as well as effective factors, including existing infrastructure and industrial centers, and so on. The purpose of this research is to model and estimate the amount of acoustic emission in the streets of Tehran's third district, using the 3D spatial info...
متن کاملThree dimensional interpretation of an indoor scene from a single image
One of the main goals of computer vision has been understanding the overall scene and the various components in the scene from an image. In order to guarantee that the resulting output is valid in three dimensions, three dimensional reasoning must be done. My previous work on reconstruction of indoor building structures and detecting cuboid and rectangular objects have made use of strict three ...
متن کاملExtending the Qualitative Trajectory Calculus Based on the Concept of Accessibility of Moving Objects in the Paths
Qualitative spatial representation and reasoning are among the important capabilities in intelligent geospatial information system development. Although a large contribution to the study of moving objects has been attributed to the quantitative use and analysis of data, such calculations are ineffective when there is little inaccurate data on position and geometry or when explicitly explaining ...
متن کاملContext-aware Modeling for Spatio-temporal Data Transmitted from a Wireless Body Sensor Network
Context-aware systems must be interoperable and work across different platforms at any time and in any place. Context data collected from wireless body area networks (WBAN) may be heterogeneous and imperfect, which makes their design and implementation difficult. In this research, we introduce a model which takes the dynamic nature of a context-aware system into consideration. This model is con...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- AI Magazine
دوره 9 شماره
صفحات -
تاریخ انتشار 1988